Search CORE

179 research outputs found

Fast Scalable Construction of (Minimal Perfect Hash) Functions

Author: A Goerdt
AM Frieze
AM Odlyzko
BA LaMacchia
BS Majewski
D Belazzougui
D Belazzougui
D Belazzougui
D Belazzougui
FC Botelho
M Aumüller
M Dietzfelbinger
M Dietzfelbinger
N Fountoulakis
Publication venue
Publication date: 22/03/2016
Field of study

Recent advances in random linear systems on finite fields have paved the way for the construction of constant-time data structures representing static functions and minimal perfect hash functions using less space with respect to existing techniques. The main obstruction for any practical application of these results is the cubic-time Gaussian elimination required to solve these linear systems: despite they can be made very small, the computation is still too slow to be feasible. In this paper we describe in detail a number of heuristics and programming techniques to speed up the resolution of these systems by several orders of magnitude, making the overall construction competitive with the standard and widely used MWHC technique, which is based on hypergraph peeling. In particular, we introduce broadword programming techniques for fast equation manipulation and a lazy Gaussian elimination algorithm. We also describe a number of technical improvements to the data structure which further reduce space usage and improve lookup speed. Our implementation of these techniques yields a minimal perfect hash function data structure occupying 2.24 bits per element, compared to 2.68 for MWHC-based ones, and a static function data structure which reduces the multiplicative overhead from 1.23 to 1.03

arXiv.org e-Print Archive

Crossref

On the probability of rendezvous in graphs

Author: Dietzfelbinger M.
Tamaki H.
Publication venue: Max-Planck-Institut für Informatik
Publication date: 01/01/2003
Field of study

In a simple graph

G

without isolated nodes the following random experiment is carried out: each node chooses one of its neighbors uniformly at random. We say a rendezvous occurs if there are adjacent nodes

u

and

v

such that

u

chooses

v

and

v

chooses

u

; the probability that this happens is denoted by

s(G)

. M{\'e}tivier \emph{et al.} (2000) asked whether it is true that

s(G)\ge s(K_n)

for all

n

-node graphs

G

, where

K_n

is the complete graph on

n

nodes. We show that this is the case. Moreover, we show that evaluating

s(G)

for a given graph

G

is a \numberP-complete problem, even if only

d

-regular graphs are considered, for any

d\ge5

MPG.PuRe

Quicksort, Largest Bucket, and Min-Wise Hashing with Limited Independence

Author: A. Siegel
H. Karloff
J.L. Carter
J.P. Schmidt
M. Dietzfelbinger
M. Pǎtraşcu
R. Motwani
T. Christiani
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Randomized algorithms and data structures are often analyzed under the assumption of access to a perfect source of randomness. The most fundamental metric used to measure how "random" a hash function or a random number generator is, is its independence: a sequence of random variables is said to be

k

-independent if every variable is uniform and every size

k

subset is independent. In this paper we consider three classic algorithms under limited independence. We provide new bounds for randomized quicksort, min-wise hashing and largest bucket size under limited independence. Our results can be summarized as follows. -Randomized quicksort. When pivot elements are computed using a

5

-independent hash function, Karloff and Raghavan, J.ACM'93 showed

O ( n \log n)

expected worst-case running time for a special version of quicksort. We improve upon this, showing that the same running time is achieved with only

4

-independence. -Min-wise hashing. For a set

A

, consider the probability of a particular element being mapped to the smallest hash value. It is known that

5

-independence implies the optimal probability

O (1 /n)

. Broder et al., STOC'98 showed that

2

-independence implies it is

O(1 / \sqrt{|A|})

. We show a matching lower bound as well as new tight bounds for

3

- and

4

-independent hash functions. -Largest bucket. We consider the case where

n

balls are distributed to

n

buckets using a

k

-independent hash function and analyze the largest bucket size. Alon et. al, STOC'97 showed that there exists a

2

-independent hash function implying a bucket of size

\Omega ( n^{1/2})

. We generalize the bound, providing a

k

-independent family of functions that imply size

\Omega ( n^{1/k})

.Comment: Submitted to ICALP 201

arXiv.org e-Print Archive

Crossref

Copenhagen University Research Information System

The IT University of Copenhagen's Repository

Improved Approximate String Matching and Regular Expression Matching on Ziv-Lempel Compressed Texts

Author: A. Amir
E.W. Myers
G. Navarro
G. Navarro
G. Navarro
G.M. Landau
J. Kärkkäinen
J. Ziv
J. Ziv
K. Thompson
M. Dietzfelbinger
M. Farach
P. Sellers
R. Cole
T.A. Welch
V. Mäkinen
Publication venue
Publication date: 01/01/2007
Field of study

We study the approximate string matching and regular expression matching problem for the case when the text to be searched is compressed with the Ziv-Lempel adaptive dictionary compression schemes. We present a time-space trade-off that leads to algorithms improving the previously known complexities for both problems. In particular, we significantly improve the space bounds, which in practical applications are likely to be a bottleneck

arXiv.org e-Print Archive

CiteSeerX

Crossref

University of Southern Denmark Research Output

Online Research Database In Technology

A Reconfigurations Analogue of Brooks’ Theorem

Author: Csuhaj-Varjú Ersébet
Dietzfelbinger Martin
Feghali C.
Johnson M.
Paulusma D.
Ésik Zoltán
Publication venue
Publication date: 01/01/2014
Field of study

Let G be a simple undirected graph on n vertices with maximum degree Δ. Brooks’ Theorem states that G has a Δ-colouring unless G is a complete graph, or a cycle with an odd number of vertices. To recolour G is to obtain a new proper colouring by changing the colour of one vertex. We show that from a k-colouring, k > Δ, a Δ-colouring of G can be obtained by a sequence of O(n 2) recolourings using only the original k colours unless G is a complete graph or a cycle with an odd number of vertices, or k = Δ + 1, G is Δ-regular and, for each vertex v in G, no two neighbours of v are coloured alike. We use this result to study the reconfiguration graph R k (G) of the k-colourings of G. The vertex set of R k (G) is the set of all possible k-colourings of G and two colourings are adjacent if they differ on exactly one vertex. It is known that if k ≤ Δ(G), then R k (G) might not be connected and it is possible that its connected components have superpolynomial diameter, if k ≥ Δ(G) + 2, then R k (G) is connected and has diameter O(n 2). We complete this structural classification by settling the missing case: if k = Δ(G) + 1, then R k (G) consists of isolated vertices and at most one further component which has diameter O(n 2). We also describe completely the computational complexity classification of the problem of deciding whether two k-colourings of a graph G of maximum degree Δ belong to the same component of R k (G) by settling the case k = Δ(G) + 1. The problem is O(n 2) time solvable for k = 3, PSPACE-complete for 4 ≤ k ≤ Δ(G), O(n) time solvable for k = Δ(G) + 1, O(1) time solvable for k ≥ Δ(G) + 2 (the answer is always yes)

Durham Research Online

Knocking Out P_k-free Graphs

Author: Csuhaj-Varjú Ersébet
Dietzfelbinger Martin
Johnson M.
Paulusma D.
Stewart A.
Ésik Zoltán
Publication venue
Publication date: 01/01/2014
Field of study

A parallel knock-out scheme for a graph proceeds in rounds in each of which each surviving vertex eliminates one of its surviving neighbours. A graph is KO-reducible if there exists such a scheme that eliminates every vertex in the graph. The Parallel Knock-Out problem is to decide whether a graph G is KO-reducible. This problem is known to be NP-complete and has been studied for several graph classes since MFCS 2004. We show that the problem is NP-complete even for split graphs, a subclass of P 5-free graphs. In contrast, our main result is that it is linear-time solvable for P 4-free graphs (cographs)

Durham Research Online

Matchings on infinite graphs

Author: C Bordenave
C Bordenave
CD Godsil
Charles Bordenave
D Aldous
DJ Aldous
G Elek
G Elek
J Aronson
Justin Salez
M Dietzfelbinger
Marc Lelarge
OJ Heilmann
R Lyons
T Bohman
Publication venue
Publication date: 01/01/2011
Field of study

Elek and Lippner (2010) showed that the convergence of a sequence of bounded-degree graphs implies the existence of a limit for the proportion of vertices covered by a maximum matching. We provide a characterization of the limiting parameter via a local recursion defined directly on the limit of the graph sequence. Interestingly, the recursion may admit multiple solutions, implying non-trivial long-range dependencies between the covered vertices. We overcome this lack of correlation decay by introducing a perturbative parameter (temperature), which we let progressively go to zero. This allows us to uniquely identify the correct solution. In the important case where the graph limit is a unimodular Galton-Watson tree, the recursion simplifies into a distributional equation that can be solved explicitly, leading to a new asymptotic formula that considerably extends the well-known one by Karp and Sipser for Erd\"os-R\'enyi random graphs.Comment: 23 page

arXiv.org e-Print Archive

CiteSeerX

Crossref

Scientific Publications of the University of Toulouse II Le Mirail

INRIA a CCSD electronic archive server

HAL-INSA Toulouse

Hal-Diderot

Forbidden Induced Subgraphs and the Price of Connectivity for Feedback Vertex Set

Author: Belmonte R.
Csuhaj-Varjú Ersébet
Dietzfelbinger Martin
Hof van 't P.
Kaminski M.
Paulusma D.
Ésik Zoltán
Publication venue
Publication date: 01/01/2014
Field of study

Let fvs(G) and cfvs(G) denote the cardinalities of a minimum feedback vertex set and a minimum connected feedback vertex set of a graph G, respectively. For a graph class G, the price of connectivity for feedback vertex set (poc-fvs) for G is defined as the maximum ratio cfvs(G)/fvs(G) over all connected graphs G in G. It is known that the poc-fvs for general graphs is unbounded. We study the poc-fvs for graph classes defined by a finite family H of forbidden induced subgraphs. We characterize exactly those finite families H for which the poc-fvs for H-free graphs is bounded by a constant. Prior to our work, such a result was only known for the case where |H|=1

Durham Research Online

Wear Minimization for Cuckoo Hashing: How Not to Throw a Lot of Eggs into One Basket

Author: A. Ben-Aroya
A. Kirsch
A.M. Frieze
A.M. Frieze
D. Fotakis
E. Lehman
H.-S.P. Wong
J. Schmidt-Pruzan
L. Devroye
M. Dietzfelbinger
M. Karoński
P. Pavan
R. Bez
R. Pagh
S. Irani
Y. Arbitman
Y. Azar
Y.-H. Chang
Publication venue
Publication date: 01/01/2014
Field of study

We study wear-leveling techniques for cuckoo hashing, showing that it is possible to achieve a memory wear bound of

\log\log n+O(1)

after the insertion of

n

items into a table of size

Cn

for a suitable constant

C

using cuckoo hashing. Moreover, we study our cuckoo hashing method empirically, showing that it significantly improves on the memory wear performance for classic cuckoo hashing and linear probing in practice.Comment: 13 pages, 1 table, 7 figures; to appear at the 13th Symposium on Experimental Algorithms (SEA 2014

arXiv.org e-Print Archive

Crossref

Обзор подходов к автоматизации управленческой деятельности

Author: A.A. Razborov
B. Kalyanasundaram
I.I. Macarie
J. Hromkovič
J. Hromkovič
M. Dietzfelbinger
M. Sauerhoff
N. Immermann
P. Ďuriš
R. Freivalds
R. Szelepcsěnyi
Publication venue
Publication date: 01/01/2003
Field of study

Electronic archive of Tomsk Polytechnic University

Crossref